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Abstract 

Because there is considerable variation in gene expression even between closely related species, it is clear that gene 
regulatory mechanisms evolve relatively rapidly. Because primary sequence conservation is an unreliable proxy for 
functional conservation of c/s-regulatory elements, their assessment must be carried out in vivo. We conducted a survey of 
c/s-regulatory conservation between C. elegans and closely related species C. briggsae, C. remanei, C. brenneri, and C. 
japonica. We tested enhancers of eight genes from these species by introducing them into C. elegans and analyzing the 
expression patterns they drove. Our results support several notable conclusions. Most exogenous cis elements direct 
expression in the same cells as their C. elegans orthologs, confirming gross conservation of regulatory mechanisms. 
However, the majority of exogenous elements, when placed in C. elegans, also directed expression in cells outside 
endogenous patterns, suggesting functional divergence. Recurrent ectopic expression of different promoters in the same C. 
elegans cells may reflect biases in the directions in which expression patterns can evolve due to shared regulatory logic of 
coexpressed genes. The fact that, despite differences between individual genes, several patterns repeatedly emerged from 
our survey, encourages us to think that general rules governing regulatory evolution may exist and be discoverable. 
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Introduction 

A complex network of molecular interactions that orchestrates 
gene expression provides multiple sources for regulatory variation 
between species [1]. Changes in transcriptional regulation can 
occur in two fundamentally different ways: in trans regulators [2,3], 
for example through changes in protein sequences or expression 
patterns of transcription factors, or in cis elements via changes in 
identity or location of transcription factor binding sites [4,5]. 
Although the importance of variation in gene regulation for 
evolution is well appreciated [6-8], many details remain to be 
elucidated. For example, do mutations in cis arise and go to 
fixation more frequently than changes in trans [9,10]? Are 
regulatory mutations pleiotropic and, if so, what are their effects 
[1 1]? Our research has focused on cif-regulatory elements (CREs). 
These sequences consist of multiple transcription factor binding 
sites and a core promoter, but these motifs tend to be short, diffuse, 
and flexible in their locations [12]. Traditional sequence 
alignments may not therefore be reliable indicators of functional 
conservation [13]. Because cis elements integrate signals from 
multiple trans-acting factors in the context of an intact cell, their 
functions have to be assessed in vivo [14]. 

The study of functional evolution of ar-regulatory elements has 
relied on two approaches. One typically starts with the knowledge 
of the location of binding sites in a regulatory sequence of one 
species and is followed up by the functional tests of these binding 



sites in the original and other species [15,16]. This approach is 
labor-intensive and is more difficult to scale. An alternative 
consists of assessing the functions of orthologous regulatory 
sequences, without detailed knowledge of identity and location 
of binding sites, from multiple species in the same taHj-regulatory 
environment (reviewed in [17]). This approach has the advantage 
of being applicable to less well-studied regulatory regions and can 
be scaled up to multiple genes, allowing researchers to infer 
general rules of regulatory sequence evolution. 

Because they often use different methodologies and criteria for 
comparisons, studies that investigate the regulatory evolution of 
individual genes are not easily comparable. It has therefore been 
difficult to generalize results and infer common features of as- 
regulatory evolution. Still, several trends are evident. Multiple 
studies documented divergence [18-22] and constraint [23-25] in 
fir-regulatory mechanisms between species. While functionally 
equivalent enhancers in different species are often found in similar 
locations [26,27], this is not always the case [28-30]. In some 
cases, differences in fir-regulatory mechanisms reflect divergence 
in endogenous expression patterns [30,31]. In others, divergent 
regulatory mechanisms underlie overdy conserved endogenous 
expression patterns [32-34], suggesting compensatory changes in 
cis and in trans [17,22,35]. 

In this study, we aimed to survey the amount of functional 
variation that exists in gene regulatory elements of closely related 
species. C. elegans offers an attractive model system for this work 
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Author Summary 

Given the importance of gene expression changes in 
evolution, a better understanding of how they accumulate 
is desirable. However, gene regulation is a complex 
biochemical process and it is not a priori clear whether 
general trends even exist. We systematically addressed this 
question by testing, in C. elegans, the functions of 
regulatory elements of eight different genes from four 
other nematodes. We saw rampant variation in gene 
regulatory mechanisms, even between closely related 
species. While the differences were usually seen in a 
relatively small number of cells, there was a discernible 
trend - there were many more instances of gain, rather 
than loss of expression, compared to patterns directed by 
the C. elegans cis elements. Finally, the recurrence of 
ectopic expression in the same cells suggests that the 
paths open to evolution may be constrained by the 
composition of regulatory elements. We view these 
patterns as a reflection of general mechanisms of gene 
regulatory evolution and suggest that these can be 
refined, and others discovered, using systematic functional 
tests. 



because of its simple and invariant anatomy [36,37], which is 
conserved with close relatives [38]. The ease of describing gene 
expression with a single-cell resolution permits more precise 
comparisons than those possible in other multicellular model 
systems. Cu-regulatory sequences are often located within 1 kb 
upstream of the translation start site [39]. Several species from the 
Caenorhabditis genus that are approximately as divergent as 
human and mouse [40] are routinely used for comparisons with 
C. elegans. 

We selected eight genes from five Caenorhabditis species that 
have available genome sequences: C. elegans, C. briggsae, C. remanei, 
and C. brenneri, the latter three equidistant to C. elegans; and C. 
japonica, a more distandy related species. In all cases, orthologous 
regulatory sequences were cloned, and the expression patterns 
they drove were evaluated in the C. elegans teni-regulatory 
environment. We report several general trends of m-regulatory 
divergence gleaned from these observations. 

Results 

Rationale and approach 

The goal of this study is essentially comparative, that is, to test 
whether orthologous cis elements are functionally equivalent. Our 
work is part of a broader research program aiming to investigate 
functional divergence of gene regulatory systems [41]. In this study 
we introduced as-regulatory sequences (fused to GFP reporters) 
from several Caenorhabditis species into C. elegans and compared 
their expression patterns to those of their C. elegans orthologs. This 
approach can be seen as an extension of a fruitful paradigm that 
analyzes gene expression in hybrid organisms [21,42-44]. In our 
experiments the "hybrid" portions of the genome range from a 
few hundred to a few thousand nucleotides directing gene 
expression. 

While it is certainly desirable to document endogenous gene 
expression patterns and uncover all regulatory elements required 
to direct them, these questions remain outside the scope of our 
experimental program. Instead, our goal is to assess functional 
conservation of cw-regulatory sequences. To do so, we need only to 
ascertain whether of elements from different species direct the 
same or different expression patterns. To ensure comparability, 



only the sequences from the immediately upstream regions were 
considered; consequently, if some regulatory sequences are located 
in introns, transgenes may not recapitulate the entire endogenous 
expression patterns. Movements of of elements between the 
upstream intergenic regions in one species and introns in another, 
dubbed "nomadic" enhancers [30], illustrate one type of 
regulatory divergence our approach can uncover. Due to the 
persistence of the GFP protein, we are unlikely to detect minor 
dynamic differences in expression patterns. Testing all cis- 
regulatory elements in the common /ranj-regulatory environment 
of C. elegans simplifies the interpretation of these comparative data 
- any difference in expression patterns, whether gain or loss, 
reveals functional divergence between orthologous ci-regulatory 
elements, regardless of the expression patterns driven by these 
sequences in their endogenous franj-regulatory environments. 

Selection of species and genes to be tested 

In addition to C. elegans, we selected for our study four species 
with sequenced genomes: C. briggsae, C. remanei, C brenneri, and C. 
japonica [45,46]. We decided to focus on these species because, 
based on previous experience [19,22,47-51], we anticipated many 
a'j-regulatory functions to be substantially conserved. Given the 
established phylogenetic relationships between these five species 
[52], our experiments interrogated the extent of functional 
divergence accumulated over two time scales - one between C. 
elegans and the equidistant C. briggsae/ C. remanei/ C brenneri, and 
another between C. elegans and a more distant C. japonica (Figure 1). 
Estimates suggest that the phylogenetic distance between the latter 
pair of species is comparable to that within the Sophophora 
subgenus of Drosophila [40,52] or vertebrate classes [53]. While 
the phylogeny is well-resolved, the paucity of fossil Rhabditidae 
nematodes [54] precludes a reliable estimate of the age of species 
divergence. 

We focused on eight genes expressed in relatively small groups 
of easily identifiable cells. Three genes are terminal effectors of the 
GABAergic fate: unc-25 [55], unc-46 [56], unc-47 [57], and are thus 
expressed in all GABAergic neurons. Two other genes, oig-1 and 
acr-14 [58], are thought to be expressed in subsets of GABAergic 
neurons. We chose these five coexpressed and partially coregu- 
lated [58] genes to test whether shared regulation imposes 
particular constraints on their evolution. To offset this bias to a 
particular class of neurons, we added two genes expressed in other 
neuronal types - one expressed in amphid (chemosensory) 
neurons, gpa-5 [59], and one expressed in serotonergic neurons, 
mod-5 [60]. The pattern of serotonergic neurons is conserved 
between C. elegans, C. briggsae and C. remanei [61]; the pattern of 
GABAergic neurons is conserved between C. elegans and C. briggsae 
[22], as well as with C. remanei and C. brenneri (AB & IR, 
unpublished data). Finally, we included one gene expressed outside 
the nervous system, kat-1, which encodes a conserved thiolase [62] 
involved in a fat storage pathway [63] . 

The protein-coding sequences of all eight genes are highly 
conserved (Figure 1). Moreover, the synteny with the immediate 
upstream genes is conserved among all five species (Figure SI), 
making us confident that all of them are single-copy, one-to-one 
orthologs of the C. elegans genes. We tested the entire intergenic 
regions containing putative cis elements to ensure that comparisons 
indeed included orthologous regulatory sequences. 

In contrast with the high conservation of coding sequences, the 
noncoding upstream regions (which we assume to contain the 
majority of CREs [39]) are much more variable. We aligned 
orthologous intergenic sequences upstream of C. briggsae, C. remanei, 
C. brenneri, and C. japonica to their C. elegans counterparts and 
visualized the results using software package VISTA [64]. The 
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Figure 1. Species and genes included in this study. Phylogenetic relationship of the five studied species. Numbers represent relative 

conservation of protein sequences (compared to C. elegans) based on the BLOSUM matrix. 

doi:10.1371/journal.pgen.1004435.g001 



CREs of unc-46, acr-14, and unc-47 showed relatively high levels of 
conservation, spanning — 150 to 300 nucleotides in most or all 
species (Figures 2A-4A). The CREs of kat-1 and unc-25 displayed 
somewhat lower conservation, although some blocks of high 
similarity could still be clearly identified (Figures 5A, 6A). The 
CREs of gpa-5, oig-1, and mod-5 had little obvious evidence of 
conservation in the proximity of the translation start site 
(Figures 7A-9A), although some regions of putative conservation 
were present substantially upstream of these genes. Sequence 
comparisons within non-coding regions are notoriously challeng- 
ing because we do not understand the "rules" by which these 
sequences evolve [1]. Therefore, we considered two additional 
measures of sequence divergence, namely the length of the longest 
contiguous sequence that is perfectly conserved between orthologs 
and the number of nucleotides contained within blocks of perfect 
conservation of 7 bp and longer. By both of these measures, cis 
elements of unc-46 and acr-14, and to some extent unc-47, appear 
to be more conserved than those of the rest of the genes included 
in this study (Table SI). Next we tested functional conservation of 
these regulatory elements. In all experiments we used sequences 
upstream of translation start sites, thus making translational fusion 
genes, to ensure that the tested regions encompass basal promoters 
and more distal regulatory sequences. 

Pervasive functional divergence in cis elements 

Expression patterns directed in C. elegans by the orthologous cis 
elements of the eight studied genes were largely similar (Figures 2— 
9; detailed descriptions of the observed patterns are presented in 
Text SI). However, patterns driven by heterologous CREs were 
indistinguishable from those directed by their C. elegans orthologs 
in only three instances: C. brenneri unc-25 (Figure 6B), C. remanei gpa- 
5 (Figure 7B), and C. brenneri mod-5 (Figure 9B). In the rest of the 
cases, the expression patterns of heterologous CREs differed from 
their C. elegans counterparts. Some failed to direct expression in 
some of the cells in which C. elegans cis elements were active, others 
drove expression in additional cells. For reasons of brevity, in the 
following we will refer to the former as "losses" and to the latter as 
"gains" or ectopic expression, without the implication that these 
reflect differences in endogenous expression patterns. They do, 
however, reveal instances of divergence of the regulatory 
mechanisms controlling expression of orthologous genes in the 
examined species. 

"Losses" of expression in the endogenous pattern typically 
affected single cell types. In two cases (unc-46 and unc-25; 
Figures 2B and 6B), the expression patterns driven by the C. 
elegans CREs were completely recapitulated by all heterologous 
CREs. In three instances (unc-47, gpa-5, and oig-1; Figures 4B, 7B, 
and 8B), while the patterns were qualitatively conserved, portions 
directed by one or more heterologous CREs were markedly 
decreased, in frequency or intensity. For example, the C. remanei cis 
element of unc-47 drives weak and inconsistent expression in 



the neuron RIS (Figure 4B), the C. briggsae and C. brenneri CREs of 
gpa-5 direct weak and inconsistent expression in AWAL/R 
(Figure 7B), and the C. remanei, C. brenneri and C. japonica CREs 
of oig-1 are expressed inconsistendy in DVB (Figure 8B). The C. 
japonica CRE of acr-14 fails to direct expression in several cell types 
in the ventral nerve cord, only maintaining expression in D-type 
neurons, while expression in AVAL/R is much weaker than with 
other species' CREs (Figure 3B). The C. remanei and C. brenneri 
CREs of kat-1 fail to drive expression in the gonadal sheath 
(Figure 5B), the somatic tissue enveloping the proximal gonad. In 
the most severe case, mod-5, the CREs from C. briggsae and C. 
remanei only support expression in ADFL/R (Figure 9B). 

In addition to "losses" of expression in subsets of endogenous 
patterns, most heterologous cis elements also drove ectopic 
expression. Indeed, only six tested CREs did not show any 
evidence of "gain" of expression: C. remanei unc-47 (Figure 4B), C. 
brenneri unc-25 (Figure 6B), C. remanei gpa-5 (Figure 7B), and all three 
heterologous cis elements of mod-5 (Figure 9B). Ectopic expression 
was seen in as few as one and as many as five different cell types, 
depending on the gene. In some cases, this expression was driven 
in the same cells or tissues by all heterologous CREs of a given 
gene: unidentified lateral ganglion neurons in the head (unc-46, 
Figure 2B), AVnL/R neurons in the lateral ganglion (acr-14, 
Figure 3B), and head muscles (kat-1, Figure 5B). In other instances, 
only some of the orthologous elements directed co-occurring 
expression: HSNL/R for unc-46 (Figure 2B), hypodermis for kat-1 
(Figure 5B), DVB for gpa-5 (Figure 7B), and ADEL/R, PDEL/R, 
HSNL/R with oig-1 (Figure 8B). 

The results described above reveal pervasive divergence in cis- 
regulatory function. However, divergence can also stem from 
changes in trans regulators [2,3]. To test whether the trans 
environments were functionally equivalent between species, we 
compared spatial expression patterns driven by four C. briggsae 
CREs in C. elegans and C. briggsae. Although expression patterns 
generated by these sequences were qualitatively similar between 
the two species, in every instance there were reproducible 
differences as well (Figure S2). These results further reinforce the 
notion that divergence has taken place in both cis- and tow- 
regulatory mechanisms. 

Discussion 

We carried out functional comparisons of orthologous regula- 
tory elements of eight genes from Caenorhabditis nematodes. Our 
experimental paradigm, placing orthologous cis elements into the 
common fran^-regulatory environment of C. elegans, allows infer- 
ences to be made about the extent of functional divergence 
between C. elegans CREs and their orthologs from other species. 
Because we selected genes expressed in relatively simple patterns, 
we were able to detect even subtle differences. Our results support 
four notable conclusions. 
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Figure 2. Functional conservation and divergence of unc-46 regulation. (A) Vista plots represent primary sequence conservation in the 
intergenic region upstream of unc-46, relative to C. elegans. Window size = 20 bp, threshold: 70%. From top to bottom: C. briggsae, C. remanei, C. 
brenneri, C. japonica. (B) Expression patterns driven by the C. elegans (Cel), C. briggsae (Cbr), C. remanei (Cre), C. brenneri (Cbn), and C. japonica (Cja) 
CREs of unc-46. For all cells, frequency of expression is indicated, except for D-type neurons for which the median number of expressing cells in 
shown. For groups of multiple cells, percentages represent frequency of expression in at least one of these cells: RMEs(RMED/V/L/R), SIADs (SIADL/R), 
OLQDs (OLQDL/R), Lat. gang, (unidentified pair of neurons in the lateral ganglion), HSNs (HSNL/R). Detailed data are shown in Table S3. 
doi:1 0.1 371 /journal. pgen.1004435.g002 



Divergence is pervasive 

Most of the orthologous cis elements we analyzed directed 
patterns of expression in C. elegans that either substantially or 
completely matched the expression patterns of the orthologous C. 
elegans CREs (Figures 2-8; with the possible exception of mod-5, 
Figure 9). This result, supported by 30 transgenes, suggests that the 
mechanisms controlling orthologous gene expression are largely 
conserved among the studied species. Yet, in the vast majority of 
these cases (27/30), ordiologous CREs directed expression patterns 
that differed from their C. elegans counterparts. These differences 
were fairly subtle, typically affecting only a few cells, as previously 
reported in other species [65-67] highlighting the value of detailed, 



focused, multi-gene analyses to reveal trends. Differences in the 
lengths of tested cis elements did not appear to correlate with the 
observed differences in expression patterns (Figure S3). 

We observed "losses", as well as "gains" of expression, as 
compared to the patterns generated by the C. elegans CREs. Even 
cis elements from two closely related species, C. briggsae and C. 
remanei, often differed in the expression patterns they directed, 
indicating that divergence could accumulate relatively quickly. 
Because in most instances it is difficult to establish the precise 
endogenous expression patterns of the genes, the observed 
differences either reflect lineage-specific changes in gene expres- 
sion or divergence in the mechanisms that regulate conserved 
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Figure 3. Functional conservation and divergence of acr-14 regulation. (A) Vista plots represent primary sequence conservation in the 
intergenic region upstream of acr-14, relative to C. elegans. Window size = 20 bp, threshold: 70%. From top to bottom: C. briggsae, C. remanei, C. 
brenneri, C. japonica. (B) Expression patterns driven by the C. elegans (Cel), C. briggsae (Cbr), C. remanei (Cre), C. brenneri (Cbn), and C. japonica (Cja) 
CREs of acr-14. For all cells, frequency of expression is indicated, except for the ventral nerve cord (VNC) for which the median number of expressing 
cells in shown. For groups of multiple cells, percentages represent frequency of expression in at least one of these cells: AVAs (AVAL/R), AFDs (AFDL/ 
R), CEPs (CEPDA/ L/R), AVns (AVHL/R or AVJL/R or AVDL/R), PVNs (PVNL/R). Reductions of expression compared to the endogenous pattern are circled. 
Detailed data are shown in Table S4. 
doi:1 0.1 371 /journal.pgen.1 004435.g003 



expression. In several cases, however, compelling indirect evidence 
points to the latter scenario. 

Three of the eight genes in this study, unc-25, unc-46, and unc-47, 
are terminal effectors of the GABAergic neuronal fate. Immuno- 
staining for GABA in C. elegans [68], Ascaris suum [69], and C. 
briggsae and C. remanei (AB & IR, unpublished data) revealed very 
similar patterns. Furthermore, the expression driven by the C. 
briggsae unc-47 CRE in its endogenous fraas-regulatory environment 
is identical to that driven by the C. elegans unc-47 CRE in C. elegans 
[22]. Similarly, patterns of immunostaining for serotonin in C. 
elegans, C. briggsae, and C. remanei were identical [61,70]. These 



results suggest that the number and relative position of GABAergic 
and serotonergic neurons, and thus the expression patterns of key 
genes defining these neuronal fates (the three GABA genes above 
and mod-5), are conserved among these Caenorhabditis nema- 
todes. Thus, differences in cis regulatory elements of these four 
genes (Figures 2B, 4B, 6B, 9B) likely reveal changes in the specific 
ways in which these conserved expression patterns are encoded. 
This interpretation stresses noticeable divergence in gene regula- 
tion even between closely related lineages, consistent with what has 
been seen in others species [71,72]. This view suggests that 
changes in fr<ro-regulatory mechanisms and ctj-regulatory ele- 
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Figure 4. Functional conservation and divergence of unc-47 regulation. (A) Vista plots represent primary sequence conservation in the 
intergenic region upstream of unc-47, relative to C. elegans. Window size = 20 bp, threshold: 70%. From top to bottom: C. briggsae, C. remanei, C. 
brenneri, C. japonica. (B) Expression patterns driven by the C. elegans (Cel), C. briggsae (Cbr), C. remanei (Cre), C. brenneri (Cbn), and C. japonica (Cja) 
CREs of unc-47. For all cells, frequency of expression is indicated, except for D-type neurons for which the median number of expressing cells in 
shown. For groups of multiple cells, percentages represent frequency of expression in at least one of these cells: RMEs (RMED/V/L/R), SIADs (SIADL/R), 
CEPs (CEPDA/ L/R), SDQs (SDQL/R), PVNs (PVNL/R). It is unclear whether expression in the SIADs is endogenous [56,57,68]. However, since it is 
consistently seen with the C. elegans CRE, we included it in the endogenous pattern. We classified the strong expression of the C. briggsae unc-47 CRE 
in SDQL/R as ectopic, even though weak SDQR expression was observed with the C. elegans CRE, because of the dramatic differences in the 
frequency and intensity of expression [22]. Reduction and losses of expression compared to the endogenous pattern are circled. Detailed data are 
shown in Table S5. 

doi:10.1371/journal.pgen.1004435.g004 



merits accumulate in a somewhat compensatory fashion to ensure 
that the overall expression patterns of genes remain conserved 
[22,35,42,73]. The different expression patterns of four C. briggsae 
CREs in C. elegans and C. briggsae (Figure S2) support the idea that 
teni-regulatory divergence is prevalent. 

Sequence conservation and functional divergence 

Consistent with previous reports [73-75], we saw no obvious 
correspondence between the extent of large-scale sequence 



conservation and functional conservation. For example, while 
the CREs of unc-25 and oig-1 show relatively scant primary 
sequence conservation, their functions appear to be conserved no 
less well (Figures 6, 8) than those of genes with apparendy greater 
sequence conservation (e.g. une-46, Figure 2). Sequence compar- 
isons in noncoding regions, particularly when these are of different 
length, are notoriously challenging. Other metrics of sequence 
similarity, like the portion of the CRE that is conserved, also failed 
to reveal a discernible relationship to functional conservation 
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Figure 5. Functional conservation and divergence of kat-1 regulation. (A) Vista plots represent primary sequence conservation in the 
intergenic region upstream of kat-1, relative to C. elegans. Window size = 20 bp, threshold: 70%. From top to bottom: C. briggsae, C. remanei, C. 
brenneri, C. japonica. (B) Expression patterns driven by the C. elegans (Cel), C. briggsae (Cbr), C. remanei (Cre), C. brenneri (Cbn), and C. japonica (Cja) 
CREs of kat-1. Frequency of expression in different tissues is shown: Pha. (pharynx), Int. (intestine), Gon. sheath (gonadal sheath), Head muse, (head 
muscles), Hypod. (hypodermis), Head neur. (head neurons), VNC (ventral nerve cord). Losses of expression compared to the endogenous pattern are 
circled. Detailed data are shown in Table S6. 
doi:1 0.1 371 /journal.pgen.1 004435.g005 



(Figure S3, Table SI). We also tested shorter cis elements oimod-5 
and unc-25 that excluded the majority of conserved sequence 
blocks; their expression patterns were qualitatively similar to those 
of their longer counterparts (data not shown). These findings are 
consistent with previous reports that conserved expression patterns 
can be driven by highly divergent regulatory elements [76-83]. 
Previous research suggested that at least in some instances, long 
tracts of conserved sequences in cis elements may reflect particular 
features of regulatory organization, rather than unusually stringent 
selection for the maintenance of expression patterns [84] . 

Collectively, these results suggest that we may need to 
reevaluate a common reliance on large-scale sequence conserva- 
tion when using comparative sequence data to identify 



«'.r-regulatory elements. Presence or absence of transcription 
factors binding sites, their arrangement and spacing may be more 
informative, although harder to detect [73,85-89]. 

We did not detect greater functional divergence of CREs from 
the more distant C. japonica compared to C. briggsae, C. remanei, and 
C. brenneri. Among the six genes that have been tested from all four 
of these species, C. japonica cis elements show approximately the 
same number of "gains" and "losses" as their orthologs from other 
species (Table S2). It is possible that the ~2-fold difference in the 
phylogenetic distance [40] separating, on the one hand, C. elegans 
and C. japonica and, on the other hand, C. elegans and C. briggsae/ C. 
remanei/ C. brenneri, does not offer enough power to test this 
hypothesis. Examining more distantly related pairs of species may 
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Figure 6. Functional conservation and divergence of unc-25 regulation. (A) Vista plots represent primary sequence conservation in the 
intergenic region upstream of unc-25, relative to C. elegans. Window size = 20 bp, threshold: 70%. From top to bottom: C. briggsae, C. remanei, C. 
brenneri, C. japonica. (B) Expression patterns driven by the C. elegans (Cel), C. briggsae (Cbr), C. remanei (Cre), C. brenneri (Cbn), and C. japonica (Cja) 
CREs of unc-25. For all cells, frequency of expression is indicated, except for D-type neurons for which the median number of expressing cells in 
shown. For groups of multiple cells, percentages represent frequency of expression in at least one of these cells: RMEs (RMED/V/L/R), SIADs (SIADL/R). 
Detailed data are shown in Table S7. 
doi:1 0.1 371 /journal.pgen.1 004435.g006 



be required. Finally, the complexity of the expression pattern of a 
gene does not seem to be correlated with the amount of functional 
divergence in its cis element (Figure S3). 

"Gains" are more common than "losses" 

One striking pattern evident in our results is that a substantial 
majority of functional differences between orthologous cis elements 
is due to "gain", rather than "loss" or reduction, of expression 
relative to the pattern directed by the C. elegans CREs. Put another 
way, when tested in C. elegans, heterologous regulatory elements 
more commonly directed expression in more rather than fewer 
cells, compared to the C. elegatis-driven patterns. When all 



experiments reported here are considered together, the total 
number of "gains" was nearly three-fold higher than the number 
of "losses" (51 vs. 18). Even when minor differences in patterns are 
counted as "losses", their number (23) is still less than half than 
that of "gains" (51). This phenomenon does not appear to be due 
to greater power to detect "gains" compared to "losses" (Figure 
S4). Restricting comparisons only to those genes for which all four 
non-C. elegans species were tested, does not substantially alter this 
conclusion (12 vs. 44 or 16 vs. 44, if "losses" are counted more 
liberally). Therefore, our results suggest that the two regulatory 
modalities, namely one directing expression in certain cells and 
another repressing inappropriate expression, evolve at different 
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Figure 7. Functional conservation and divergence of gpa-5 regulation. (A) Vista plots represent primary sequence conservation in the 
intergenic region upstream of gpa-5, relative to C. elegans. Window size = 20 bp, threshold: 70%. From top to bottom: C. briggsae, C. remanei, C. 
brenneri, C.japonica. (B) Expression patterns driven by the C. elegans (Cel), C. briggsae (Cbr), C. remanei (Cre), and C. brenneri (Cbn) CREs of gpa-5. For 
all cells, frequency of expression is indicated. For groups of multiple cells, percentages represent frequency of expression in at least one of these cells: 
AWAs (AWAL/R), MCs (MCL/R), PVNs (PVNL/R), PVQs (PVQL/R). Reductions of expression compared to the endogenous pattern are circled. Detailed 
data are shown in Table S8. 
doi:1 0.1 371 /journal.pgen.1 004435.g007 



rates. The molecular mechanisms and evolutionary forces that 
could account for this observation remain to be investigated. It is 
possible, however, that the positive and negative regulatory aspects 
of gene regulation evolve under different regimes, because of the 
difference in the ways in which they are encoded within of 
elements. 

Recurrent divergence patterns suggest developmental 
bias in evolutionary trajectories 

The relatively large number of cases in which heterologous cis 
elements directed ectopic expression when in C. elegans, allowed 
us to investigate whether these "gains" followed a pattern. 
Notably, for the neuronal genes unc-46, acr-14, unc-47, unc-25, 
oig-1, and gpa-5, nearly all "gains" occurred in neurons (Figures 2— 
4, 6-8). This tropism suggests that the regulatory architecture of 
neuronal CREs - some transcriptional inputs are pan-neuronal in 
nature [90,91] - may restrict ectopic expression to neurons. We 
further noted that in several instances, CREs of different genes or 



from different species directed ectopic expression in the same cells 
(Figure 10). The cells "gaining" expression do not appear to be 
transcriptionally promiscuous, because ectopic expression is seen 
in several different cells not previously noted for indiscriminate 
expression (Text SI). Furthermore, the "gain" of expression is not 
likely to be due to effects of vector sequences. We used a standard 
vector utilized by us and others thousands of times. Previous 
studies using this vector documented ectopic expression in the 
intestine and pharynx [49,92] , not specific subsets of neurons, as 
we reported here. Instead, we favor a hypothesis that the cis 
elements themselves could be sharing certain characteristics that 
make them more likely to direct expression in particular cells. The 
recurrent "gains" of expression were seen for unc-46, acr-14, unc- 
47, and oig-1, which are coexpressed in a subset of GABAergic 
neurons and are know to be coregulated by at least one 
transcription factor, UNC-30 [58]. It is therefore plausible that 
these cis elements share some features, for example transcription 
factor binding sites or general organization, and that this similarity 
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Figure 8. Functional conservation and divergence of oig-1 regulation. (A) Vista plots represent primary sequence conservation in the 
intergenic region upstream of oig-1, relative to C. elegans. Window size = 20 bp, threshold: 70%. From top to bottom: C. briggsae, C. remanei, C. 
brenneri, C. japonica. (B) Expression patterns driven by the C. elegans (Cel), C. briggsae (Cbr), C. remanei (Cre), C. brenneri (Cbn), and C. japonica (Cja) 
CREs of oig-1. For all cells, frequency of expression is indicated, except for D-type neurons for which the median number of expressing cells in shown. 
For groups of multiple cells, percentages represent frequency of expression in at least one of these cells: Head neurons (large cluster of head neurons, 
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may bias the trajectories that evolution could follow [15]. This 
may in part account for the commonly observed instances of 
parallel evolution [33,93-95]. 

With this survey, we established several trends of functional 
conservation and divergence of cz'j-regulatory elements. We found 
pervasive functional divergence in transcriptional regulatory mech- 
anisms, both in cis and in trans. More strikingly, we identified inherent 
biases in the nature and functional consequences of this divergence, 
hinting at possible mechanisms underlying repeated evolution. 



Materials and Methods 

Cloning of c/s-regulatory elements 

Putative m-regulatory elements (extending from the first exon to 
the nearest upstream gene) were PCR amplified from genomic 
DNA using Phusion polymerase and cloned upstream of GFP into 
the pPD95.75 plasmid, routinely used for analysis of gene 
expression in C. elegans [96]. Cloned fragments were sequenced 
to ensure accuracy. C. elegans CREs of unc-46, acr-14, kat-1, unc-25, 
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Figure 9. Functional conservation and divergence of mod-5 regulation. (A) Vista plots represent primary sequence conservation in the 
intergenic region upstream of mod-5, relative to C. elegans. Window size = 20 bp, threshold: 70%. From top to bottom: C. briggsae, C. remanei, C. 
brenneri, C.japonica. (B) Expression patterns driven by the C. elegans (Cel), C. briggsae (Cbr), C. remanei (Cre), and C. brenneri (Cbn) CREs of mod-5. For 
all cells, frequency of expression is indicated. For groups of multiple cells, percentages represent frequency of expression in at least one of these cells: 
ADFs (ADFL/R), AIMs (AIML/R). Losses of expression compared to the endogenous pattern are circled. Detailed data are shown in Table S10. 
doi:1 0.1 371 /journal.pgen.1 004435.g009 



and oig-1, were also cloned into the plasmid HYM153 (kind gift of 
H.-Y. Mak) upstream of the mCherry reporter gene as controls. 

Strains 

C. elegans transgenic lines were established by injecting into pha- 
l(e2123) worms cocktails consisting of 5 ng/|xL CRE::GFP 
reporter constructs with 5 ng/|lL rescue plasmid [97] and 
100 ng/uE salmon sperm DNA; this is thought to facilitate the 
formation of complex transgenic constructs as extrachromosomal 
arrays [98]. For five genes (unc-46, acr-14, kat-1, unc-25, and oig-1), 
plasmids carrying C. elegans CREs fused to mCherry were 
coinjected with the plasmids carrying orthologous CREs from 
each of the five species fused to GFP. C. briggsae transgenic lines 
were established by injecting cocktails consisting of 5 ng/uE 
CRE::GFP reporter constructs with 5 ng/u\L rescue plasmid and 
100 ng/uL salmon sperm DNA into Cbr-unc-119 (nm67) worms 
[99]. 



Microscopy 

Mixed-stage populations of transgenic worms were grown with 
abundant food and L4-stage larvae or young adults were selected. 
These were immobilized on agar slides with 1 0 mM sodium azide 
in M9 buffer. The slides were examined on a Leica DM5000B 
compound microscope under 400-fold magnification. Worms 
without any visible GFP expression were assumed to have lost 
the transgene. Each photograph showing worms in figures is 
composed of several images of the same individual capturing 
anterior, middle, and posterior sections. 

Analysis 

At least fifty individuals from no fewer than two independent 
strains were analyzed for each transgene. The plasmid pPD95.75 
has been used extensively by the C. elegans community over the last 
two decades. It has been reported to direct low-level background 
expression in the pharynx and anterior and posterior intestine 
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Figure 10. Recurrent "gains" of expression by different CREs. CREs of acr-14 from C. brenneri (Figure 3B) and unc-47 from C. japonica 
(Figure 4B) drive expression in CEP neurons in the head. CREs of unc-46 from C. brenneri and C. japonica (Figure 2B) and oig-1 from C. briggsae, C. 
remanei and C. brenneri (Figure 8B) drive expression in HSN neurons in the mid-body. CREs of acr-14 from C. remanei (Figure 3B) and unc-47 from C. 
japonica (Figure 4B) drive expression in PVN neurons in the tail. 
doi:10.1371/journal.pgen.1004435.g010 



[49,92,96]. We have previously reported that extrachromosomal 
arrays direct expression patterns that are concordant with those of 
integrated and single-copy transgenes [22,100]. Still, to obtain 
conservative estimates of expression differences between CREs 
from C. elegans and other species, we only counted discrepancies 
(missing or extra expression) observed in two or more strains. Data 
on consistency of expression patterns between strains and 
individuals are presented in Tables SI, S2, S3, S4, S5, S6, S7, 
S8 and S10. 

Supporting Information 

Figure SI Synteny is conserved across all five species for the 
eight genes studied. Schematic representation of synteny and 
intergenic distances for unc-46, acr-14, unc-47, kat-1, unc-25, gpa-5, 
oig-1, and mod- 5. In each set, from top to bottom: C. elegans, C. 
briggsae, C. remanei, C. brenneri, C. japonica. 
(PDF) 

Figure S2 Divergence in fraro-regulatory mechanisms. (A— D) 
Comparisons of the expression patterns driven in C. elegans and C. 
briggsae by CREs of (A) C. briggsae unc-46, (B) C. briggsae unc-25, (C) 
C. briggsae gpa-5, (D) C. briggsae oig-1. Abbreviations of cell names 
and the meaning of values are the same as in corresponding 
Figures 2B, 6B, 7B, and 8B. Detailed data are shown in Table SI 1. 
(PDF) 

Figure S3 Functional divergence does not correlate with 
complexity of expression patterns or primary sequence conserva- 
tion. (A) Complexity of expression pattern, measured as the 
number of endogenously expressing cell types, does not correlate 
with functional divergence of cu-regulatory elements, as measured 
by differences (expressing cell types) of C. elegans and orthologous 
CREs. (B) Primary sequence conservation, as measured by the 
fraction of CRE sequences contained in conserved blocks of 20 
nucleotides or more, does not correlate with functional divergence 
of or-regulatory elements. (C) Primary sequence conservation does 
not correlate with complexity of expression patterns. (D) 
Difference in length of CRE sequences does not correlate with 
functional divergence. Each data point represents a single cis- 
regulatory element; all comparisons are to C. elegans. 
(PDF) 



Figure S4 "Gains" of expression are more frequent than 
"losses." The curves represent sorted frequencies of "losses" 
of expression along the endogenous pattern (blue) and "gains" of 
expression (pink). Frequency of "loss" refers to frequency of 
endogenous cells not expressing a heterologous transgene. 
Frequency of "gain" refers to frequency of expression in non- 
endogenous cells. For example, a frequency of 20% "loss" refers to 
80% of transgenic individuals showing expression in a particular 
cell type, whereas 20% "gain" indicates that 20% of transgenic 
individuals show ectopic expression in a particular cell type. Since 
expression in the ventral nerve cord was measured as a median, 
and not a frequency, this plot does not include ventral nerve cord 
data. For every possible frequency threshold below 100%, 
instances of "gain" outnumber instances of "loss." 
(PDF) 

Table SI Conservation of primary sequence in CREs between 

C. elegans and C. briggsae. 

(XLSX) 

Table S2 "Gains" and "losses" of expression relative to C. 

elegans. 

(DOC) 

Table S3 Expression patterns of unc-46 cis elements. Raw data 

for expression patterns reported in Figure 2. 

(XLS) 

Table S4 Expression patterns of acr-14 cis elements. Raw data 

for expression patterns reported in Figure 3. 

(XLS) 

Table S5 Expression patterns of unc-47 cis elements. Raw data 

for expression patterns reported in Figure 4. 

(XLS) 

Table S6 Expression patterns of kat-1 cis elements. Raw data for 
expression patterns reported in Figure 5. Expression is counted as 
present or absent in a specific cell type. 
(XLS) 

Table S7 Expression patterns of unc-25 cis elements. Raw data 

for expression patterns reported in Figure 6. 

(XLS) 
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Table S8 Expression patterns oigpa-5 cis elements. Raw data for 

expression patterns reported in Figure 7. 

(XLS) 

Table S9 Expression patterns of oig-1 cis elements. Raw data for 

expression patterns reported in Figure 8. 

(XLS) 

Table S10 Expression patterns of mod-5 cis elements. Raw data 

for expression patterns reported in Figure 9. 

(XLS) 

Table Sll Expression patterns of C. briggsae transgenes in C. 
briggsae. Raw data for expression patterns reported in Figure 
S2. 
(XLS) 



Text SI Detailed description of Figures 2-9 and supplemental 

discussion. 

(DOC) 

Acknowledgments 

We are grateful to Aime Agather, Kelsie Eichel, Andrew Gehrke, and 
Zhengying He for excellent technical help and Ho Yi Mak for sharing 
reagents. We thank Robert Arthur, Kacy Gordon, Manu, John Reinitz, 
and Chung-I Wu for critical reading of the manuscript and insightful 
suggestions, and members of the Ruvinsky laboratory for helpful 
discussions. 

Author Contributions 

Conceived and designed the experiments: AB IR. Performed the 
experiments: AB. Analyzed the data: AB IR. Wrote the paper: AB IR. 



References 

1. Wray GA, Hahn MW, Abouhcif E, BalhoffJP, Pizcr M, ct al. (2003) The 
evolution of transcriptional regulation in eukaryotes. Mol Biol Evol 20: 1377- 
1419. doi:10.1093/molbev/msgl40. 

2. Lynch VJ, Wagner GP (2008) Resurrecting the role of transcription factor 
change in developmental evolution. Evolution 62: 2131-2154. doi: 10.1111/ 
j.l558-5646.2008.00440.x. 

3. Hsia CC, MeGinnis W (200.3) Evolution of transcription factor function. Curr 
Opin Genet Dcv 13: 199-206. doi:10.1016/S0959-4.37X(03)00017-0. 

4. Tuch BB, Galgoczy DJ, Hernday AD, Li H, Johnson AD (2008) The evolution 
of combinatorial gene regulation in fungi. PLoS Biol 6: e38. doi: 10.1371/ 
jouraal.pbio.0060038. 

5. Wray GA (2007) The evolutionary significance of cis-rcgulatory mutations. Nat 
Rev Genet 8: 206-216. doi:10.10.38/nrg2063. 

6. Carroll SB (2000) Endless forms: the evolution of gene regulation and 
morphological diversity. Cell 101: 577-580. 

7. Brcm RB, Yvcrt G, Clinton R, Kruglyak L (2002) Genetic dissection of 
transcriptional regulation in budding yeast. Science 296: 752-755. doi: 10. 1 126/ 
science. 10695 16. 

8. Stern DL (2010) Evolution, development, and the predictable genome. 1st ed. 
Roberts & Company Publishers. 

9. Wittkopp PJ (2005) Genomic sources of regulatory variation in cis and in trans. 
Cell Mol Life Sci 62: 1779-1783. doi: 10.1 007 /s000 18-005-5064-9. 

10. Grubcr JD, Vogel K, Kalay G, Wittkopp PJ (2012) Contrasting properties of 
gene-specific regulatory, coding, and copy number mutations in Saccharomyccs 
ccrcvisiae: frequency, effects, and dominance. PLoS Genet 8: el002497. 
doi: 10. 1 .37 1 /journal.pgen. 1 002497. 

11. Landry CR, Lemos B, Rilkin SA, Dickinson WJ, Hartl DL (2007) Genetic 
properties intlucncing the evolvability of gene expression. Science 317: 1 18-121. 
doi:10.1 126/scicncc.l 140247. 

12. Davidson EH (2001) Genomic Regulatory Systems: Development and 
Evolution. Academic Press. 

1.3. Pollard DA, Moses AM, Iyer VN, Eiscn MB (2006) Detecting the limits of 
regulatory element conservation and divergence estimation using pairwise and 
multiple alignments. BMC Bioinformatics 7: 376. doi: 1 0. 1 186/ 147 1 -2105-7-376. 

14. Ho MCW, Johnscn H, Goctz SE, Schiller BJ, Bac E, ct al. (2009) Functional 
evolution of cis-regulatory modules at a homeotic gene in Drosophila. PLoS 
Genet 5: cl000709. doi:10.1371/journal.pgen.l000709. 

15. Brown CD, Johnson DS, Sidow A (2007) Functional Architecture and Evolution 
of Transcriptional Elements That Drive Gene Cocxpression. Science 317: 1557— 
1560. doi:10.1126/science.l 145893. 

16. Cameron RA, Davidson EH (2009) Flexibility of transcription factor target site 
position in conserved cis-regulatory modules. Dcv Biol 336: 122-135. 
doi: 1 0. 10 16/j.ydbio.2009.09.0 1 8. 

17. Gordon KL, Ruvinsky I (2012) Tempo and mode in evolution of transcriptional 
regulation. PLoS Genet 8: el002432. doi:10.1.371/journal.pgen.l002432. 

18. Wang X, Chamberlin HM (2002) Multiple regulatory changes contribute to the 
evolution of the Cacnorhabditis lin-48 ovo gene. Genes Dev 16: 2345-2349. 
doi:10.1101/gad.996302. 

19. Wang X, Greenberg JF, Chamberlin HM (2004) Evolution of regulatory 
elements producing a conserved gene expression pattern in Cacnorhabditis. Evol 
Dcv 6: 237-245. doi: 10.1 1 1 1/j.1525-142X.2004.04029.x. 

20. Marccllini S, Simpson P (2006) Two or four bristles: functional evolution of an 
enhancer of scute in Drosophilidae. PLoS Biol 4: e386. doi:10.1371/joumal.- 
pbio.0040386. 

21. Sung H-M, Wang T-Y, Wang D, Huang Y-S, Wu J-P, ct al. (2009) Roles of 
trans and cis variation in yeast intraspecies evolution of gene expression. Mol 
Biol Evol 26: 2533-2538. doi:10.1093/molbev/mspl71. 

22. Barrierc A, Gordon KL, Ruvinsky I (2012) Coevolution within and between 
Regulatory Loci Can Preserve Promoter Function Despite Evolutionary Rate 
Acceleration. PLoS Genet 8: cl002961. doi:10.1371/journal.pgen.l002961. 



23. Chiang DY, Moses AM, Kellis M, Lander ES, Eisen MB (200.3) Phylogenetically 
and spatially conserved word pairs associated with gene-expression changes in 
yeasts. Genome Biol 4: R43. doklO.l 186/gb-200.3-4-7-r43. 

24. Grad YH, Roth FP, Halfon MS, Church GM (2004) Prediction of similarly 
acting cis-regulatory modules by subsequence profiling and comparative 
genomics in Drosophila mclanogastcr and D.pscudoobscura. Bioinformatics 
20: 2738-2750. doi:10.1093/bioinformatics/bth320. 

25. Rebeiz M, Castro B, Liu F, Yue F, PosakonyJW (2012) Ancestral and conserved 
cis-regulatory architectures in developmental control genes. Dev Biol .362: 282— 
294. doi: 1 0. 1 0 1 6/j.ydbio.20 11.12.011. 

26. Hare EE, Peterson BK, Iyer VN, Meier R, Eisen MB (2008) Scpsid even-skipped 
enhancers are functionally conserved in Drosophila despite lack of sequence 
conservation. PLoS Genet 4: cl000106. doi: 10. 1371 /journal.pgen. 1000 106. 

27. Cande J, Goltscv Y, Levinc MS (2009) Conservation of enhancer location in 
divergent insects. Proc Natl Acad Sci 106: 14414-14419. doi:10.1073/ 
pnas.0905754106. 

28. Pan D, Valentine SA, Courey AJ (1994) The bipartite D. melanogaster twist 
promoter is reorganized in D. virilis. Mcch Dcv 46: 41—53. 

29. Hauenschild A, Ringrosc L, Altmutter C, Paro R, Rehmsmeier M (2008) 
Evolutionary plasticity of polycomb/ trithorax response elements in Drosophila 
species. PLoS Biol 6: c261. doi:10.1371/joumal.pbio.0060261. 

30. Kalay G, Wittkopp PJ (2010) Nomadic enhancers: tissue-specific cis-regulatory 
elements of yellow have divergent genomic positions among Drosophila species. 
PLoS Genet 6: cl001222. doi:10.1371/journal.pgen.l001222. 

31. Frankcl N, Erczyilmaz DF, McGregor AP, Wang S, Payrc F, ct al. (2011) 
Morphological evolution caused by many subtle-effect substitutions in regulatory 
DNA. Nature 474: 598-603. doi:10.1038/naturel0200. 

32. Ludwig MZ, Bergman C, Patel NH, Kreitman M (2000) Evidence for stabilizing 
selection in eukaryotic enhancer element. Nature 403: 564—567. 

3.3. Compel N, Prud'homme B, Wittkopp PJ, Kassncr VA, Carroll SB (2005) 
Chance caught on the wing: cis-regulatory evolution and the origin of pigment 
patterns in Drosophila. Nature 433: 481-487. doi:10.1038/nature03235. 

34. Prud'homme B, Compel N, Rokas A, Kassncr VA, Williams TM, et al. (2006) 
Repeated morphological evolution through cis-regulatory changes in a 
plciotropic gene. Nature 440: 1050-1053. doi:10.1038/nature04597. 

35. True JR, Haag ES (2001) Developmental system drift and flexibility in 
evolutionary trajectories. Evol Dev 3: 109-119. 

36. Sulston JE, Horvitz HR (1977) Post-cmbryonic cell lineages of the nematode, 
Cacnorhabditis clcgans. Dev Biol 56: 1 10-156. 

37. Sulston JE, Sehiercnberg E, White JG, Thomson JN (1983) The embryonic cell 
lineage of the nematode Cacnorhabditis elegans. Dev Biol 100: 64-119. 

38. Zhao Z, Boyle TJ, Bao Z, Murray JI, Mericle B, et al. (2008) Comparative 
analysis of embryonic cell lineage between Cacnorhabditis briggsae and 
Cacnorhabditis elegans. Dcv Biol 314: 93-99. doi:10.1016/j.ydbio.2007.1 1.015. 

39. Elcmento O, Tavazoic S (2005) Fast and systematic genome-wide discovery of 
conserved regulatory elements using a non-alignment based approach. Genome 
Biol 6: R18. doi:10.1186/gb-2005-6-2-rl8. 

40. Kiontke K, Fitch DHA (2005) The phylogcnctic relationships of Cacnorhabditis 
and other rhabditids. WormBook : the online review of C. elegans biology, pp. 
1 - 1 1 . doi: 1 0. 1 895 /wormbook. 1 . 1 1 . 1 . 

41. Tirosh I, Barkai N (2011) Inferring regulatory mechanisms from patterns of 
evolutionary divergence. Mol Syst Biol 7: 530. doi: 10. 1038/msb.201 1.60. 

42. Landry CR, Wittkopp PJ, Taubcs CH, Ranz JM, Clark AG, ct al. (2005) 
Compensatory cis-trans evolution and the dysregulation of gene expression in 
interspecific hybrids of Drosophila. Genetics 171: 1813-1822. doi:10.1534/ 
gcnetics.105. 047449. 

43. Wittkopp PJ, Hacrum BK, Clark AG (2008) Regulatory changes underlying 
expression differences within and between Drosophila species. Nat Genet 40: 
346-350. doi:10.1038/ng.77. 



PLOS Genetics | www.plosgenetics.org 



13 



June 2014 | Volume 10 | Issue 6 | e1004435 



Patterns of Regulatory Divergence in Nematodes 



44. Tirosh I, Rcikhav S, Levy AA, Barkai N (2009) A yeast hybrid provides insight 
into the evolution of gene expression regulation. Science 324: 659-662. 
doi:10.1 126/scicncc.l 169766. 

45. Hillier LW, Miller RD, Baird SE, Chinwalla A, Fulton LA, ct al. (2007) 
Comparison of C. elegans and C. briggsac genome sequences reveals extensive 
conservation of chromosome organization and synteny. PLoS Biol 5: el67. 
doi: 10.1371 /journal.pbio.0050 167. 

46. Barriere A, Yang S-P, Pekarek E, Thomas CG, Haag ES, et al. (2009) Detecting 
heterozygosity in shotgun genome assemblies: Lessons from obligately 
outcrossing nematodes. Genome Res 19: 470—480. doi: 10.1 101/ 
gr.081851. 108.470. 

47. Kennedy BP, Aamodt EJ, Allen FL, Chung MA, Heschl MFP, et al. (1993) The 
Gut Esterase Gene (ges-1) From the Nematodes Caenorhabditis elegans and 
Caenorhabditis briggsae. J Mol Biol 229: 890-908. 

48. Maduro MF, Pilgrim D (1996) Conservation of function and expression of unc- 
119 from two Caenorhabditis species despite divergence of non-coding DNA. 
Gene 183: 77-85. 

49. Ruvinsky I, Ruvkun G (2003) Functional tests of enhancer conservation between 
distantly related species. Development 130: 5133-5142- doi:10.1242/dev.007 1 1. 

50. Petalcorin MIR, Joshua GW, Agapow P-M, Dolphin CT (2005) The frno genes 
of Caenorhabditis elegans and C. briggsae: characterisation, gene expression and 
comparative genomic analysis. Gene 346: 83—96. doi:10.1016/ 
j.gene.2004.09.02L 

51. Marri S, Gupta BP (2009) Dissection of lin-11 enhancer regions in 
Caenorhabditis elegans and other nematodes. Dev Biol 325: 402-411. 
doi:10.1016/j.ydbio.2008.09.026. 

52. Kiontke KC, Felix M-A, Ailion M, Rockman M V, Braendle C, et al. (201 1) A 
phylogeny and molecular barcodes for Caenorhabditis, with numerous new 
species from rotting fruits. BMC Evol Biol 11: 339. doi: 10.1 186/ 1471-2148-1 1- 
339. 

53. Kiontke K, Gavin NP, Raynes Y, Roehng C, Piano F, et al. (2004) 
Caenorhabditis phylogeny predicts convergence of hermaphroditism and 
extensive intron loss. Proc Natl Acad Sci 101: 9003-9008. doi: 10. 1073/ 
pnas.0403094101. 

54. Poinar Jr. GO (2011) The evolutionary history of nematodes. Leiden, the 
Netherlands: Brill. 

55. Jin Y, Jorgensen EM, Hartwieg E, Horvitz HR (1999) The Caenorhabditis 
elegans gene unc-25 encodes glutamic acid decarboxylase and is required lor 
synaptic transmission but not synaptic development. J Neurosci 19: 539—548. 

56. Schuske K, Palfrcyman MT, Watanabc S, Jorgensen EM (2007) UNC-46 is 
required for trafficking of the vesicular GABA transporter. Nat Neurosci 10: 
846-853. doi:10.1038/nnl920. 

57. Mclntire SL, Rcimer RJ, Schuske K, Edwards RH, Jorgensen EM (1997) 
Identification and characterization of the vesicular GABA transporter. Nature 
389: 870-876. doi:10. 1038/39908. 

58. Cinar H, Kcles S, Jin Y (2005) Expression profiling of GABAcrgic motor 
neurons in Caenorhabditis elegans. Curr Biol 15: 340-346. doi:10.1016/ 
j.cub.2005.02.025. 

59. Jansen G, Thijsscn K, Werner P (1999) The complete family of genes encoding 
G proteins of Caenorhabditis elegans. Nat Genet 21: 414—419. 

60. Ranganathan R, Sawin ER, Trent C, Horvitz HR (2001) Mutations in the 
Caenorhabditis elegans serotonin reuptake transporter MOD-5 reveal serotonin- 
dependent and -independent activities of fluoxetine. J Neurosci 21: 5871-5884. 

61. Locr CM, Rivard L (2007) Evolution of Neuronal Patterning in Free-Living 
Rhabditid Nematodes I : Sex- Specific Serotonin-Containing Neurons. J Comp 
Neurol 502: 736-767. doi:10.1002/cne. 

62. Berdichevsky A, Nedelcu S, Boulias K, Bishop NA, Guarente L, et al. (2010) 3- 
Ketoacyl thiolase delays aging of Caenorhabditis elegans and is required for 
lifespan extension mediated by sir-2.1. Proc Natl Acad Sci 107: 18927-18932. 
doi:10.1073/pnas.l013854107. 

63. Mak HY, Nelson LS, Basson M, Johnson CD, Ruvkun G (2006) Polygenic 
control of Caenorhabditis elegans fat storage. Nat Genet 38: 363—368. 
doi:10.1038/ngl739. 

64. Frazer KA, Pachter L, Poliakov A, Rubin EM, Dubchak I (2004) VISTA: 
computational tools for comparative genomics. Nucleic Acids Res 32: W273-9. 
doi:10.1093/nar/gkh458. 

65. Ludwig MZ, Palsson A, Aleksecva E, Bergman CM, Nathan J, et al. (2005) 
Functional evolution of a cis-regulatory module. PLoS Biol 3: c93. doi: 10. 1 37 1 / 
journal.pbio.0030093. 

66. Crocker J, Tamori Y, Erives A (2008) Evolution acts on enhancer organization 
to fine-tune gradient threshold readouts. PLoS Biol 6: c263. doi: 1 0. 1 37 1 / 
journal.pbio.0060263. 

67. Wunderlich Z, Bragdon MD, Eckcnrode KB, Lydiard-Martin T, Pearl- 
Waserman S, et al. (2012) Dissecting sources of quantitative gene expression 
pattern divergence between Drosophila species. Mol Syst Biol 8: 604. 
doi:10.1038/msb.2012.35. 

68. Mclntire SL, Jorgensen EM, Kaplan J, Horvitz HR (1993) The GABAcrgic 
nervous system of Caenorhabditis elegans. Nature 364: 337-341. 

69. Guastclla J, Johnson CD, Stretton AO (1991) GABA-immunoreactive neurons 
in the nematode Ascaris. J Comp Neurol 307: 584-597. doi: 10. 1002/ 
cne.903070406. 

70. Rivard L, Snnivasan J, Stone A, Ochoa S, Sternberg PW, et al. (2010) A 
comparison of experience-dependent locomotory behaviors and biogenic amine 



neurons in nematode relatives of Caenorhabditis elegans. BMC Neurosci 1 1: 22. 
doi: 10.11 86/ 1 47 1 -2202- 11-22. 

71. McManus CJ, CoolonJD, O'DuffM, Eipper-MainsJ, Graveley BR, et al. (2010) 
Regulatory divergence in Drosophila revealed by mRNA-seq. Genome Res 20: 
816-825. doi:10.1101/gr.l02491.109. 

72. Fowlkcs CC, Eckcnrode KB, Bragdon MD, Meyer M, Wunderlich Z, ct al. 

(2011) A conserved developmental patterning network produces quantitatively 
different output in multiple species of Drosophila. PLoS Genet 7: el002346. 
doi: 10.1371 /journal.pgen. 1 002346. 

73. Tirosh I, Weinberger A, Bezalcl D, Kaganovich M, Barkai N (2008) On the 
relation between promoter divergence and gene expression evolution. Mol Syst 
Biol 4: 159. doi:10.1038/msb4100198. 

74. McGaughey DM, Vinton R, Huynh J (2008) Metrics of sequence constraint 
overlook regulator)' sequences in an exhaustive analysis at phox2b. Genome Res 
18: 252-260. doi: 10. 1 101/gr.6929408.1. 

75. Ritter DI, Li Q, Kostka D, Pollard KS, Guo S, ct al. (2010) The importance of 
being cis: evolution of orthologous fish and mammalian enhancer activity. Mol 
Biol Evol 27: 2322-2332. doi:10.1093/molbev/msql28. 

76. Takahashi H, Mitani Y, Satoh G, Satoh N (1999) Evolutionary alterations of the 
minimal promoter for notochord-specific Brachyury expression in ascidian 
embryos. Development 126: 3725—3734. 

77. Oda-Ishii I, Bcrtrand V, Matsuo I, Lemaire P, Saiga H (2005) Making very 
similar embryos with divergent genomes: conservation of regulatory mechanisms 
of Otx between the ascidians Halocynthia rorctzi and Ciona intcstinalis. 
Development 132: 1663-1674. doi:10.1242/dev.01707. 

78. Dutilh BE, Huynen MA, Snel B (2006) A global definition of expression context 
is conserved between orthologs, but does not correlate with sequence 
conservation. BMC Genomics 7: 10. doi: 1 0. 1 1 86/ 1 47 1 -2 1 64-7- 1 0. 

79. Fisher S, Grice EA, Vinton RM, Bessling SL, McCallion AS (2006) 
Conservation of RET regulatory function from human to zcbrafish without 
sequence similarity. Science 312: 276-279. doi:10.1 126/science.l 124070. 

80. Cooper GM, Brown CD (2008) Qualifying the relationship between sequence 
conservation and molecular function. Genome Res 18: 201—205. doi: 10. 1 101 / 
gr.7205808.4. 

81. Taher L, McGaughey DM, Maragh S, Aneas I, Bessling SL, et al. (2011) 
Genome-wide identification of conserved regulatory function in diverged 
sequences. Genome Res 21: 1139-1149. doi: 10. 1 101/gr. 1 19016.1 10. 

82. Swanson CI, Schwimmcr DB, Barolo S (201 1) Rapid Evolutionary Rewiring of 
a Structurally Constrained Eye Enhancer. Curr Biol 21: 1-11. doi:10.1016/ 
j.cub.20 11.05.056. 

83. Nelson AC, Wardle FC (2013) Conserved non-coding elements and cis 
regulation: actions speak louder than words. Development 140: 1385-1395. 
doi:10.1242/dev.084459. 

84. Bullaughey K (201 1) Changes in selective effects over time facilitate turnover of 
enhancer sequences. Genetics 187: 567-582. doi:10. 1534/gcnetics. 1 10. 121590. 

85. Erives A, Levinc M (2004) Coordinate enhancers share common organizational 
features in the Drosophila genome. Proc Nad Acad Sci 101: 3851-3856. 
doi:10.1073/pnas.0400611101. 

86. Crocker J, Erives A (2008) A closer look at the eve stripe 2 enhancers of 
Drosophila and Themira. PLoS Genet 4: el000276. doi:10.1371/journal.p- 
gen. 1000276. 

87. Kim J, Cunningham R, James B, Wydcr S, Gibson JD, ct al. (2010) Functional 
characterization of transcription factor motifs using cross-species comparison 
across large evolutionary distances. PLoS Comput Biol 6: el 000652. 
doi: 10.1371 /journal.pcbi. 1 000652. 

88. Swanson CI, Evans NC, Barolo S (2010) Structural rules and complex 
regulatory circuitry constrain expression of a Notch- and EGFR-rcgulatcd eye 
enhancer. Dev Cell 18: 359-370. doi:10.1016/j.dcvcel.2009.12.026. 

89. Liberman LM, Stathopoulos A (2009) Design flexibility in cis-rcgulatory control 
of gene expression: synthetic and comparative evidence. Dev Biol 327: 578-589. 
doi:10.1016/j.ydbio.2008. 12.020. 

90. Ruvinsky I, Ohler U, Burgc CB, Ruvkun G (2007) Detection of broadly 
expressed neuronal genes in C. elegans. Dev Biol 302: 617-626. doi:10.1016/ 
j.ydbio.2006.09.014. 

91. Hobcrt O (201 1) Regulation of terminal differentiation programs in the nervous 
system. Annu Rev Cell Dev Biol 27: 681-696. doi:10.1 146/annurcv-ccllbio- 
092910-154226. 

92. Boulin T, Etchbcrger JF, Hobcrt O (2006) Reporter gene fusions. WormBook : 
the online review of C. elegans biology, pp. 1-23. doi:10.1895/wormbook. 
1.106.1. 

93. Stern DL, Orgogozo V (2009) Is genetic evolution predictable? Science 323: 
746-751. 

94. Rajakumar R, San Mauro D, Dijkstra MB, Huang MH, Wheeler DE, et al. 

(2012) Ancestral developmental potential facilitates parallel evolution in ants. 
Science 335: 79-82. doi:10. 1 126/scicnce. 121 1451. 

95. Rogers WA, Salomone JR, Tacy DJ, Camino EM, Davis KA, et al. (2013) 
Recurrent modification of a conserved cis-rcgulatory element underlies fruit fly 
pigmentation diversity. PLoS Genet 9: cl003740. doi:10.1371/journal.pgen. 
1003740. 

96. Xu S, Fire A, Seydoux G, Okkema P (1995) Fire Lab Vector Kit - June 1995. 

97. Granato M, Schnabel H, Schnabcl R (1994) pha-1, a selectable marker for gene 
transfer in C. elegans. Nucleic Acids Res 22: 1762-1763. 



PLOS Genetics | www.plosgenetics.org 



14 



June 2014 | Volume 10 | Issue 6 | e1004435 



Patterns of Regulatory Divergence in Nematodes 



98. Kelly W, Xu S, Montgomery M, Fire A (1997) Distinet requirements for somatic 
and germline expression of a generally expressed Caenorhabditis elegans gene. 
Genetics 146: 227-238. 

99. Woodruff GC, Eke O, Baird SE, Felix M-A, Haag ES (2010) Insights into 
species divergence and the evolution of hermaphroditism from fertile interspecies 



hybrids of Caenorhabditis nematodes. Genetics 186: 997-1012. doi:10.1534/ 
genetics. 11 0.1 20550. 

100. Barriere A, Gordon KL, Ruvinsky I (2011) Distinct Functional Constraints 
Partition Sequence Conservation in a cis-Rcgulatory Element. PLoS Genet 7: 
e 1 002095. doi: 10.1371 /journal.pgen. 1 002095. 



PLOS Genetics | www.plosgenetics.org 



15 



June 2014 | Volume 10 | Issue 6 | e1004435 



